Evolutive Speaker Segmentation using a Repository System
نویسندگان
چکیده
When performing blind speaker segmentation one of the main problems is not knowing how many speakers appear in a conversation and wether they appear once or more than once. In this paper, an iterative method, which is based on the EvolutiveHMM is presented. Two main improvements to this system are introduced. On one hand, a repository generic speaker is used to model all utterances and all speaker models are derived from this iteratively. Different normalization of the scores are applied to the repository and the speakers to emphasize speaker changes. On the other hand, in all cases we use Gaussian Mixture Models (GMM) for their flexibility compared to an HMM structure. This method has been successfully tested using multi-speaker speech sequences generated by concatenation of speech segments from Speecon.
منابع مشابه
Evolutive speaker segmentation using a repository system
When performing blind speaker segmentation one of the main problems is not knowing how many speakers appear in a conversation and wether they appear once or more than once. In this paper, an iterative method, which is based on the EvolutiveHMM is presented. Two main improvements to this system are introduced. On one hand, a repository generic speaker is used to model all utterances and all spea...
متن کاملE-HMM approach for learning and adapting sound models for speaker indexing
This paper presents an iterative process for blind speaker indexing based on a HMM. This process detects and adds speakers one after the other to the evolutive HMM (E-HMM). The use of this HMM approach takes advantage of the different components of AMIRAL automatic speaker recognition system (ASR system: frontend processing, learning, loglikelihood ratio computing) from LIA. The proposed soluti...
متن کاملThe LIA-EURECOM RT‘09 Speaker Diarization System
This paper presents LIA-EURECOM’s joint submission to the NIST Rich Transcription 2009 (RT‘09) speaker diarization evaluation. We describe a number of modifications to our previous system which involve beamforming for the multiple distant microphone (MDM) condition and also significant enhancements to the speaker segmentation stage of the core speaker diarization system. These modifications lea...
متن کاملPolitecnico di Torino Porto Institutional Repository [ Proceeding ] Loquendo - Politecnico di Torino ’ s 2006 NIST Speaker Recognition Evaluation System
This paper describes the Loquendo – Politecnico di Torino system evaluated on the 2006 NIST speaker recognition evaluation dataset. This system was among the best participants in this evaluation. It combines the results of two independent GMM systems: a Phonetic GMM and a classical GMM. Both systems rely on an intersession variation compensation approach, performed in the feature domain. It all...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004